Search CORE

178 research outputs found

A Case for Staged Database Systems

Author: Ailamaki Anastassia
Harizopoulos Stavros
Publication venue
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

QPipe: A Simultaneously Pipelined Relational Query Engine

Author: Ailamaki Anastassia
Harizopoulos Stavros
Shkapenyuk Vladislav
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

DBmbench: fast and accurate database workload representation on modern microarchitecture

Author: Ailamaki Anastassia
Falsafi Babak
Shao Minglong
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Accelerating Database Operations Using a Network Processor

Author: Ailamaki Anastassia
Falsafi Babak
Gold Brian T.
Huston Larry
Publication venue
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

DBMSs on a Modern Processor: Where Does Time Go?

Author: Ailamaki Anastassia
DeWitt David J.
Hill Mark D.
Wood David A.
Publication venue
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Temporal Streaming of Shared Memory

Author: Ailamaki Anastassia
Falsafi Babak
Hardavellas Nikos
Kim Jangwoo
Somogyi Stephen
Wenisch Thomas F.
Publication venue
Publication date: 16/10/2007
Field of study

Coherent read misses in shared-memory multiprocessors account for a substantial fraction of execution time in many important scientific and commercial workloads. We propose Temporal Streaming, to eliminate coherent read misses by streaming data to a processor in advance of the corresponding memory accesses. Temporal streaming dynamically identifies address sequences to be streamed by exploiting two common phenomena in shared-memory access patterns: (1) temporal address correlation — groups of shared addresses tend to be accessed together and in the same order, and (2) temporal stream locality — recently- accessed address streams are likely to recur. We present a practical design for temporal streaming. We evaluate our design using a combination of trace-driven and cycle- accurate full-system simulation of a cache-coherent distributed shared-memory system. We show that temporal streaming can eliminate 98% of coherent read misses in scientific applications, and between 43% and 60% in database and web server workloads. Our design yields speedups of 1.07 to 3.29 in scientific applications, and 1.06 to 1.21 in commercial workloads

Infoscience - École polytechnique fédérale de Lausanne

Database Servers on Chip Multiprocessors: Limitations and Opportunities

Author: Ailamaki Anastassia
Falsafi Babak
Hardavellas Nikos
Johnson Ryan
Mancheril Naju
Pandis Ippokratis
Publication venue
Publication date: 10/10/2007
Field of study

Prior research shows that database system performance is dominated by off-chip data stalls, resulting in a concerted effort to bring data into on-chip caches. At the same time, high levels of integration have enabled the advent of chip multiprocessors and increasingly large (and slow) on-chip caches. These two trends pose the imminent technical and research challenge of adapting high-performance data management software to a shifting hardware landscape. In this paper we characterize the performance of a commercial database server running on emerging chip multiprocessor technologies. We find that the major bottleneck of current software is data cache stalls, with L2 hit stalls rising from oblivion to become the dominant execution time component in some cases. We analyze the source of this shift and derive a list of features for future database designs to attain maximum performance

Infoscience - École polytechnique fédérale de Lausanne

MultiMap: Preserving disk locality for multidimensional datasets

Author: Ailamaki Anastassia
Ganger Gregory R.
Papadomanolakis Stratos
Schindler Jiri
Schlosser Steven W.
Shao Minglong
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/01/2009
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Memory coherence activity prediction in commercial workloads

Author: Ailamaki Anastassia
Falsafi Babak
Hardavellas Nikolaos
Kim Jangwoo
Somogyi Stephen
Wenisch Thomas F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 23/01/2009
Field of study

Recent research indicates that prediction-based coherence optimizations offer substantial performance improvements for scientific applications in distributed shared memory multiprocessors. Important commercial applications also show sensitivity to coherence latency, which will become more acute in the future as technology scales. Therefore it is important to investigate prediction of memory coherence activity in the context of commercial workloads.This paper studies a trace-based Downgrade Predictor (DGP) for predicting last stores to shared cache blocks, and a pattern-based Consumer Set Predictor (CSP) for predicting subsequent readers. We evaluate this class of predictors for the first time on commercial applications and demonstrate that our DGP correctly predicts 47%-76% of last stores. Memory sharing patterns in commercial workloads are inherently non-repetitive; hence CSP cannot attain high coverage. We perform an opportunity study of a DGP enhanced through competitive underlying predictors, and in commercial and scientific applications, demonstrate potential to increase coverage up to 14%

Infoscience - École polytechnique fédérale de Lausanne

Store-Ordered Streaming of Shared Memory

Author: Ailamaki Anastassia
Falsafi Babak
Gniady Chris
Hardavellas Nikolaos
Kim Jangwoo
Somogyi Stephen
Wenisch Thomas F.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 23/01/2009
Field of study

Coherence misses in shared-memory multiprocessors account for a substantial fraction of execution time in many important scientific and commercial workloads. Memory streaming provides a promising solution to the coherence miss bottleneck because it improves memory level parallelism and lookahead while using on-chip resources efficiently. We observe that the order in which shared data are consumed by one processor is correlated to the order in which they were produced by another. We investigate this phenomenon and demonstrate that it can be exploited to send Store- ORDered Streams (SORDS) of shared data from producers to consumers, thereby eliminating coherent read misses. Using a trace-driven analysis of all user and OS memory references in a cache-coherent distributed shared- memory multiprocessor, we show that SORDS based memory streaming can eliminate between 36% and 100% of all coherent read misses in scientific workloads and between 23% and 48%in online transaction processing workloads

Infoscience - École polytechnique fédérale de Lausanne